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BACKGROUND OF THE INVENTION 

1. Field of the Invention 

This invention relates to the field of information processing systems, and in particular to 
information processing systems that utilize cache memory to minimize latency. 

2. Description of Related Art 

Cache systems are common in the art. A cache system comprises a cache memory and a 
corresponding controller that regulates the storage and retrieval of information to and from the 
cache memory. Traditionally, the cache memory is filled with copies of information resources 
that a user receives from a remote source, "remote" being defined as being further removed from 
the user than the cache memory, e.g., local main memory or a server in a client-server 
architecture. If the user subsequently requests the same resource, the resource's copy is provided 
from the cache memory, rather than from the original remote source, thereby saving the time 
required to receive the resource from the remote source for a second time. When the cache 
memory becomes full, the cache controller removes copies of the resources that have not been 
accessed recently, to make room for copies of new resources that the user accesses. A variety of 
criteria, commonly termed caching policies, are available to determine which resource copy to 
remove from the cache memory. Such caching policies can be based on: the duration since the 
last access, the number of times accessed since originally received, the amount of memory 
allocated to the resource, the difficulty of retrieving the resource from the remote site, etc. 

Cache systems are premised on the assumption that the information at the remote source 
has not changed when the resource copy in the cache is accessed. That is, the resource copy in 
the cache should not be used in lieu of the resource at the remote source if the resource at the 
remote source has changed. Thus, in addition to removing copies of resources from the cache 
memory when it is full, the cache controller in a conventional system also removes copies of 
resources from the cache memory when it is predicted or determined that the source information 
has changed, because the copy of the resource in the cache memory is outdated, or "stale". The 
prediction of whether a resource is likely to have changed is also often used in the selection of 
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which resource copy to remove when space becomes unavailable in the cache memory. For 
example, an image at a web-site maybe expected to change less often than text at a web-site, and 
thus, a cache controller for caching information downloaded from the Internet may retain 
downloaded image information for a longer average duration than downloaded text information. 

Each particular caching policy typically involves tradeoffs between: the likelihood that a 
user will re-access a particular resource copy rather than another resource copy; the likelihood 
that the particular resource copy is stale; the likelihood that the copies of other resources are 
stale; and so on. Consider, for example, a "Least Recently Used" (LRU) caching policy, and a 
"First In, First Out" (FIFO) caching policy. The LRU caching policy selects the copies of the 
resources to be removed from cache memory based on an assumption that if a user has not re- 
accessed a resource in a long time, it is likely that the user is not going to re-access the resource. 
The FIFO caching policy, on the other hand, is independent of how often or how recently the 
user re-accesses a resource, and is based on an assumption that the longer the resource copy has 
been in cache memory, the more likely it is to be stale. With a FIFO caching policy, copies of 
commonly re-accessed resources are often needlessly reloaded from the remote source, merely 
because they were in the cache memory longer than other, potentially rarely re-accessed, 
resources. Resources from an information source that contains information that changes 
frequently, on the other hand, is likely to be accessed by a user more frequently than an 
information source that contains relatively static information. If an LRU caching policy is 
employed, the cached copy will tend to remain in the cache memory too long, because it is 
frequently accessed. 

In general terms, copies of resources are received from a remote source, and the cache 
controller determines which copy to retain in cache memory in an attempt to minimize the 
latency delay for subsequent accesses to that resource, while at the same time attempting to 
maximize the correspondence, or synchronization, between the content of the copy in the cache 
memory and the content of the resource at the remote source. 
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BRIEF SUMMARY OF THE INVENTION 
It is an object of this invention to provide a method and system for, among other things, 
controlling a cache memory to minimize access latency. It is a further object of this invention to 
provide a method and system that optimizes the allocation of cache memory. 

These objects and others are achieved by providing a cache system that caches copies of 
resources based on the semantic type of the resource. A resource copy received from a remote 
source, e.g., from a server via the Internet, is categorized by its semantic type. The caching 
policy is customized for each semantic type, using different policies for different semantic types. 
The expression "semantic type" as used within this context refers to the different connotative 
meanings that the information contents of resources can have, as perceived by the user. For 
example, some information content may be perceived as highly volatile (e.g., being of short-term 
relevance such as web sites dedicated to the results of sport matches, to specific stock market 
news or currency exchange rates), other information content may be perceived as rather static 
(e.g., being of long-term relevance such as glossaries on the Internet). Semantic types that can be 
expected to contain dynamic information, such as news Web sites and weather Web sites, need a 
caching policy wherein the copy in the cache memory is selected for replacement based upon the 
duration of time that the copy has been in the cache memory. Conversely, semantic types that 
can be expected to relate to static resources, such as encyclopedic information, glossaries, etc., 
need a more conservative caching policy, such as least- recently-used (LRU) or least-frequently- 
used (LFU), that are substantially independent of the time duration that the copy remains in the 
cache memory. Additionally, some semantic types, such as communicated news messages in 
popular newsgroups or e-mail messages in e-mail archives may employ a combination of 
caching policies wherein the copy of the resource, or copies of parts of the resource, are initially 
identified as dynamically changing, then Jess dynamic, then static. 

The relationship between semantic content type and caching policy to be associated with 
the type can be determined in advance, e.g., by the resource provider, or may be determined 
directly by the user, or could be based, at least partly, on user-history and profiling of user- 
interaction with the resources. 
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The invention also relates to a method of enabling interaction with an information 
resource, e.g., as supplied by a service provider on the Internet. The method comprises enabling 
receipt of a copy of the information from the information resource; and enabling caching the 
copy according to a caching strategy dependent on a semantic type of the information. The 
enabling of the caching comprises, for example, supplying an indication representative of the 
semantic type of an Internet Web site. The indication can be a meta tag that with an indication 
that gets interpreted at the user's client for use as a cache control parameter. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The invention is explained in further detail, and by way of example, with reference to the 
accompanying drawings wherein: 

FIG. 1 illustrates an example block diagram of a semantic caching system of the invention. 
FIG. 2 illustrates an example flow diagram for satisfying a user request using a cache memory 
system in the invention. 

FIG. 3 illustrates an example flow diagram for the storage of resources in a cache memory 
system in the invention. 

Throughout the drawings, same reference numerals indicate similar or corresponding 
features or functions. 

DETAILED DESCRIPTION OF THE INVENTION 
FIG. 1 illustrates an example block diagram of a semantic cache system 100 in the 
invention. The cache system 100 includes a cache controller 1 10, a cache memory 120 that is 
partitioned into different caches 121-129, and a set of parameters and rules 115 associated with 
each cache 121-129 and each semantic category. Physically, the cache memory 120 may be 
distributed among a variety of storage devices, or it may be a single block of memory, a partition 
of a disk drive, and so on; the caches 121-129 are logical partitions of the cache memory 120. 
Each cache 121-129 has a different caching policy. Cache 121 is a highly active cache, while 
cache 129 is a very stable cache; optionally, other intermediate activity cache partitions 125 are 
also provided. According to the invention, a copy of a resource is placed in a particular cache 
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121-129 in dependence upon the semantic type of the resource. Each of the caches has a 
corresponding set of parameters and rules 1 15, or caching policies, that control the storage 
duration and replacement of resource copies for the particular cache. A highly active cache 121, 
for example, employs replacement rules wherein the staleness of the content (the time since the 
copy was retrieved from the remote source) is the primary criterion used for determining which 
copy to replace. In a preferred embodiment of this invention, the parameters and rules 115 
associated with the highly active cache 121 also impose a maximum staleness duration for each 
cached copy. Conversely, a very static cache 129 uses a more conventional set of parameters and 
rules 1 15 to effect, for example, a Least Recently Used (LRU) replacement strategy that is 
independent of the staleness of the resource's content. 

In accordance with the invention, the semantic type of the resource determines in which 
one of caches 121-129 to place a specific copy of a resource that is retrieved, or downloaded, 
from a remote source, such as a site on the world-wide-web 180. For example, a weather report 
will be placed in a highly active cache 121, whereas an article from an encyclopedia will be 
placed in the very static cache 129. In like manner, copies of resources of other semantic types, 
such as news articles, stock reports, search results, e-mail messages, and so on, will each be 
allocated to an appropriate one of caches 121-129, based upon the dynamics of the semantic 
type. 

A request processor 150 adds and retrieves the resource material 155 to and from the 
cache system 100 in response to user requests 151. 

FIG. 2 illustrates an example flow diagram for satisfying a user request 151. Based 
initially upon the user request that is received, at 210, the semantic type of the requested resource 
is determined, at 220, via the semantic classifier 160. A variety of techniques are available for 
determining the semantic classification. In an embodiment of this invention, the user is queried 
for the category in which to place the requested information; thereafter, similar requests are 
categorized in like manner without an explicit query. Similarly, default semantic categories may 
be defined for commonly occurring request styles or formats, or based on context, such as the 
application from which the request is generated. Co-pending U.S. patent application 
"CONTEXT-BASED AND USER-PROFILE DRIVEN INFORMATION RETRIEVAL", U.S. 
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Serial No. 09/104,491 (attorney docket PHA 23,422), filed 6/25/98 for Chanda Dharap, and 
incorporated herein by reference, relates to enabling a user to navigate through an electronic data 
base in a personalized manner, wherein a context is created based on a profile of the user. The 
profile is based on topical information supplied by the user in advance, as well as a history of 
5 previous accesses of the user to the data base. In like manner, the user profile, context, and prior 
requests can be used in this invention to determine a semantic type with minimal interaction 
required from the user. Co-pending U.S. patent application " COOPERATIVE TOPICAL 
SERVERS WITH AUTOMATIC PREFILTERING AND ROUTING", U.S. Serial No. 
09/221,951 (attorney docket PHA 23,606), filed 12/28/98 for Doreen Cheng, relates to an 
10 information organization and retrieval system that organizes documents for rapid and efficient 
search and retrieval based upon topical content, and is incorporated herein by reference. The 
information organization and retrieval system is optimized for the organization and retrieval of 
O only those documents that are relevant to a given set of predefined topics. If a document does not 
jjt] have a topic that is included in the given set of topics, the document is excluded from the 
1 5 ; J provided service. In like manner, if a document includes a topic that is specifically banned from 
[1 the provided service, it is excluded. In this paradigm, the provider purposely limits the scope of 
'% the provided search and retrieval services, but in so doing provides a more efficient and effective 
~_ service that is targeted to an expected user demand. The information organization and retrieval 
m system also supports context-sensitive search and retrieval techniques, including the use of 
20}Z predefined or user-defined views for augmenting the search criteria, as well as the use of user 
yQ specific vocabularies. In a preferred embodiment, the select set of topics are organized in 

multiple overlapping hierarchies, and a distributed software architecture is used to support the 
topic-based information organization, routing, and retrieval services. Documents may be relevant 
to one or more topics, and will be associated with each topic via the topical hierarchies that are 
25 maintained by the information servers. U.S. Serial No. 09/221,951 refers to statistically based 
algorithms, neural nets and genetic algorithms, and the like, all known in the art, for automatic 
categorization of documents. 

A default semantic type may also be associated with each resource. For example, a 
service provider may pre-categorize resources as to their semantic type, by using a default 
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association dependent upon the source of the resource, or by incorporating a semantic 
determinator into the "web-crawlers" commonly used to organize resources by content upon 
retrieval. In like manner, a web-site manager may pre-categorize each resource available at the 
web-site. In general terms, a database that contains resources, or indexes to resources, may be 
5 configured to also contain a default semantic type associated with each resource, wherein the 
term database is used in the general sense to include any organized collection of material, 
including the Internet and the World- Wide- Web. Via such a database, the default semantic type 
is used except where the user specifically changes the semantic type, or where the user's prior 
behavior implies a change to the semantic type, or where another semantic type determinator, 
10 such as those discussed above, provides a different result. 

At block 230 of FIG. 2, the cache system 100 of FIG. 1 is queried by the request 
processor 150 to determine whether the request can be satisfied by the cache system 100. The 
0 cache controller 110 responds to this query based upon the current status of the requested 
u\ resource and the parameters and rules 115 associated with the particular cache. If the requested 
15 "J resource is not at the cache 100, either because a copy of the resource was never placed in the 
JJ! cache memory 120, or because the copy has subsequently been removed from the cache memory 
% 120, the cache controller 110 notifies the request processor 150 that the request cannot be 
5 _ satisfied. Additionally, in accordance with this invention, the cache controller 110 determines the 
fg suitability of the resources that are currently in the cache memory, based on the parameters and 
20™ rales 1 15 associated with each of the caches 121-129. For example, the highly active cache 121 
*D may be used, for example, to store retrieved stock prices, and may have a "staleness" parameter 
115 that specifies that any resource copy contained in the cache 121 that is older than fifteen 
minutes is deemed "stale", and irretrievable. An intermediately active cache 125 may be where 
copies of resources that are associated with a "news" semantic type are stored, and may have a 
25 staleness parameter 1 15 that specifies, e.g., a two hour limit before a resource is deemed stale 
and irretrievable. The static cache 129 will typically not have a staleness parameter associated 
with it, and will typically contain copies of resources that are associated with encyclopedic and 
other semantic types that are substantially unchanged over time. 
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As noted above, each semantic type category has associated rules and parameters 1 15 for 
organizing and retrieving each resource copy within the cache memory 120. In a preferred 
embodiment, these rules and parameters 115 also include rules for transferring resource copies 
from one cache to another. Using the aforementioned news semantic type as an example, the 
5 rules 115 associated with the receipt of a news resource could be to place the resource copy in 
the highly active cache 121 until it is determined to be stale, or until it must be removed to make 
room for newer active material, and then move it to the intermediately active cache 125, then to 
the highly static cache 129. Such a process is termed herein as a percolating cache process, 
wherein the resource copy eventually bubbles up through the active caches and is deposited into 
10 the static cache. The percolating cache process may also contain a selection mechanism to 

determine that only select items are percolated into static cache, the remainder being discarded, 
or, conversely, to determine that select items are discarded, the remainder being percolated into 
0 static cache. 

iTjj As is evident to one of ordinary skill in the art in view of this invention, prior art cache 

15 2 optimizations may also be employed within each of caches 121-129. For example, the rules for 
m removing items from active cache 121 may be structured such that copies of relatively stable 
% resources, such as images, remain in cache longer than more dynamic, or more easily retrievable, 
" resources, such as text. In this manner, for example, if a news resource copy is discarded, then 
fl subsequently re-accessed, only the text and additional images need be re-downloaded, based on 
20!Z the assumption that previously downloaded images contained in the resource will not have 
changed. 

Returning to FIG. 2, at 240, if the cache system 100 can comply with the user request 
151, the resource copy is retrieved from the cache memory 120, at 250. If the user request 151 
cannot be satisfied via the cache system 1 00, the resource copy is retrieved from the remote 
25 source, such as the Internet and the World- Wide- Web, or other network source, at 260. 

Optionally, at 270, the semantic type of the retrieved resource copy can be re-determined, 
or reassessed, based on the actual content of the resource. Co-pending U.S. patent application 
"COOPERATIVE TOPICAL SERVERS WITH AUTOMATIC PREFILTERING AND 
ROUTING", U.S. Serial No. 09/221,951, filed 12/28/98 for Doreen Cheng, referred to above, 
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relates to classifying a document into one or more select topical clusters, based on the material 
contained within the document. For example, a user request 151 of FIG. 1 could have been 
determined to correspond to a news type, and the retrieved news resource copy 185 may contain 
a weather advisory. In accordance with this aspect of the invention, the retrieved news resource 
5 copy, or a portion thereof, may be stored in accordance with the rules 1 1 5 associated with 

weather related resources. In like manner, within each semantic type, other related criteria may 
be used to determine the appropriate cache for each resource copy. For example, some news 
resource copies may be archived in the static cache 129, while others may be temporarily stored 
in the highly active cache 121 . For example, if the user request for a news resource is determined 
10 to be a request for current news, based, for example, on the aforementioned user profile, context, 
and prior requests, the retrieved information will be placed in the active cache 121 . If, on the 
other hand, based on the context of the user request, the request processor 150 determines that 
O the user request for a news resource is for any news on a particular subject, the retrieved news 
l'T| resource copies may be further segregated based on the content or other identification of the 
15 ^! resource, and each resource copy will be placed in an appropriate one of caches 121-129 based 
11 on this further segregation. In a preferred embodiment, the origination date associated with each 
"15 retrieved news resource copy is used to place copies of recently created news resources in active 
; cache 121, and copies of older news resources in static cache 129. Conversely, if the semantic 
|B type associated with the resource is encyclopedic, the origination date would be substantially 
20 Li irrelevant to the determination of in which one of caches 121-129 to place the resource copy. 
yD The retrieved resource copy is stored in the appropriate one (or ones) of caches 121-129 

of the cache memory 120, at 280, and the process continues, at 290. Referring to FIG. 1, the 
retrieved resource copy 185, 155, from either the network 180, or the cache 100, respectively, is 
presented to the user via the presentation jdevice 190. As illustrated in FIG. 1, the presentation 
25 device 190 is typically a display device, although other presentation devices, such as an audio or 
other sensory presentation device could be utilized, depending upon the particular resource being 
presented, or the preference of the user, for example, via a text-to-speech or speech-to-text 
process. 
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FIG. 3 illustrates an example flow diagram for the storage process of the cache system 
100. In accordance with the invention, the operation of the cache system 100 depends upon the 
semantic type of the resource copy being stored. As discussed above, the semantic type is 
provided to the cache system 100 by the request processor 150, and is illustrated as an input 305 
to the process of FIG. 3. For ease of understanding, the example flow diagram of FIG. 3 applies 
to a cache memory 120 that is partitioned into an active cache 121 and a static cache 129 only. 
The addition of one or more intermediately active caches 125, with varying degrees of dynamics, 
or staleness criteria, will be clear to one of ordinary skill in the art in view of this disclosure. 

The semantic type 305 is used to determine the parameters and rules 115 associated with 
the storage of the resource, at 310. As illustrated in FIG. 3, the semantic type in a preferred 
embodiment determines whether the caching is to be active, static, or percolating. If the caching 
is static, at 315, the static cache 129 is checked to see if sufficient unallocated memory is 
available to store the resource copy, at 355. If sufficient unallocated memory is not available, a 
currently stored resource copy in the cache 129 is selected for removal, at 360. As noted above, 
the removal of a resource copy from static cache 129 is based on criteria that are substantially 
independent of the staleness of the resource. Any one of a variety of removal criteria may be 
employed, such as the removal of the least recently used (LRU) resource copy, the removal of 
the least frequently used (LFU) resource copy, and so on. After identifying or creating sufficient 
unallocated memory, the resource copy is stored in the static cache memory 129, at 370. 

If, at 315, the caching is active or percolating, the active cache memory 121 is checked to 
see if sufficient unallocated memory is available to store the resource copy, at 325. If sufficient 
active storage is not available, previously stored resource copies are removed from the active 
cache memory 121, at 330, to provide sufficient unallocated active memory for storing the new 
resource copy. As discussed above, the criteria for selecting a resource to remove from the active 
cache 121 are highly dependent upon the staleness of each resource copy in the active cache 121, 
That is, because the copies of resources are placed in the active cache 121 based on the 
anticipated dynamics associated with their semantic type, older copies are preferably removed, 
independently, for example, of how recently, or how often, the particular resource has been re- 
accessed. If, at 335, the previously stored resource copy was stored in active cache as the first 
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phase of the aforementioned percolating process, it is transferred to the static cache 129, using 
the process described above, starting at decision block 355. The current resource copy is stored 
in the identified, or created unallocated, active cache, at 340. After storing the resource copy in 
the appropriate cache 121, 129, the process continues, at 390. 

The foregoing merely illustrates the principles of the invention. It will thus be 
appreciated that those skilled in the art will be able to devise various arrangements which, 
although not explicitly described or shown herein, embody the principles of the invention and are 
thus within its spirit and scope. For example, the particular semantic type classifications used can 
be customized to each user or each application. The amount of cache memory 120 allocated to 
each cache 121-129 can be dynamically reallocated based on the effectiveness of the cache 
system 100 in fulfilling user requests. Also, the semantic type can be provided to the network 
retriever 170 to also reduce latency or improve retrieval efficiency and effectiveness by 
facilitating a network retrieval using a directed search based on the semantic type. 

The specific structure and control flow illustrated in the figures are presented for 
illustrative purposes, and alternative structures and flows are feasible. For example, the function 
of the semantic classifier 180 may be embodied in the cache system 100, the request processor 
150, or the network retriever 170. In like manner, all of the resource copies, for example, may be 
contained in a single cache structure, the logical partitioning of the cache memory being effected 
by the application of rules and parameters 115 that incorporate the dynamics of the semantic type 
as parameters associated with each stored resource copy. For example, each resource copy in the 
cache may have an associated individual staleness criterion, such as a maximum time duration in 
cache memory, that is determined by the semantic type. The single caching policy determines 
which resource copy to remove based on this staleness criterion, as well as other, more 
conventional cache criteria such as LRU, LFU, and so on. The determination of a resource copy 
to be removed in this embodiment is based upon a weighted averaging scheme that is strongly 
influenced by the staleness of resources having a low staleness criterion. In this manner, dynamic 
resources, having a low staleness criterion, are selected more often for replacement as their time 
duration in cache increases. More stable resources that have a high staleness criterion are 
selected for removal from cache based on convention criteria, such as LRU and LFU, after stale 
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copies of dynamic resources are no longer present in the cache. In like manner, a percolation 
parameter can be associated with each resource. The percolation parameter contains a non-zero 
value for resource copies that are to be percolated into less and less active storage, and a zero 
value for non-percolating resources. Periodically, the percolation parameter of each resource is 
added to each resource's staleness criterion, thereby periodically decreasing the likelihood of 
removing each percolating resource copy based on staleness. These and other system 
configuration and optimization features will be evident to one of ordinary skill in the art in view 
of this disclosure, and are included within the scope of the following claims. 
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CLAIMS 

I claim: 



1. A method of processing an information resource, the method comprising: 




receiving a copy of the information resource from a remote source, and 
caching the copy of the information resource in dependence upon a semantic type 
associated with the information resource. 

2. The method of claim 1, wherein 

the caching of the copy comprises at least one of: a static caching, an active caching, and 
a percolating caching. 

3. The method of claim 1, further including: 

receiving a recpiest from a user, and 

determining the semantic type based on the request from the user. 

4. The method of claim 3, wherein 

the determining of the semantic type includes at least one of: 
determining a context of the request, 
determining a prior request from the user, 
determining a profile of the user, and 

determining a response from the user to a result of a prior request. 

5. The method of claim 1, further including: 

determining the semantic type based on an information content of the resource. 

6. The method of claim 1, wherein 

the remote source comprises an Internet site. 
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7. A method of enabling interaction with an information resource, the method comprising: 

enabling receiving a copy of the information from the information resource, and 
enabling caching the copy according to a caching strategy dependent on a semantic type 
of the information. 

8. The method of claim 7, wherein: 

the enabling of the caching comprises supplying an indication representative of the 
semantic type. 

9. The method of claim 7, wherein the information resource comprises an Internet Web site. 

10. A method of processing an information resource copy, comprising: 

determining at least one parameter associated with a semantic type of the resource copy, 
and ^ 

processing the resource copy in dependence upon the at least one parameter associated 
with the semantic type of the resource copy. 

1 1 . The method of claim 10, wherein 

the at least one parameter associated with the semantic type of the resource copy 
comprises a threshold for a time duration, and 

the method further comprises: 

precluding the retrieval of the resource copy if the time duration of the resource 
exceeds the threshold. 

12. An information processing system comprising: 

a processor that receives a request from a user for an information resource, 
a retriever, operably coupled to the processor, that facilitates a reception of a-copy of the 
resource from a remote site, and 
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a cache system, operably coupled to the request processor, that facilitates a storage and a 
retrieval of the copy of the resource, the cache system comprising: 
a cache memory for storing the copy, and 

a cache controller operably coupled to the cache memory for control of the 
storage and retrieval of the copy in the cache memory in dependence upon a semantic type of the 
resource. 

13. The system of claim 12, wherein 

the cache memory^omprises a plurality of cache sections, and 
the cache controller selectively accesses a specific one of the cache sections in 
dependence upon the semantic type of the resource. 

14. The system of claim 12, further including 

a semantic classified operably coupled to the request processor, that determines the 
semantic type of the resource. 

15. The system of claim 14, wherein 

the semantic classifier determines the semantic type based on at least one of: a context of 
the user, a prior request dfthe user, a profile of the user, and a response from the user to a result 
of a previous request from the user. 

16. The system of claim 14, wherein 

the semantic classifier determines the semantic type of the resource based on a material 
content of the resource. 

17. A database comprising: 

a plurality of indexes corresponding to a plurality of information resources, and 
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a plurality of default semantic types corresponding to the plurality of information 
resources that facilitates a caching of each information resource of the plurality of information 
resources based on the default semantic type of each information resource. 

18. The database of claim 17, wherein the plurality of information resources includes an Internet 
Web site. 



19. The database of claim 17, wherein the plurality of information resources includes resources 
that are available via an Internet service provider. 

20. A Web page comprising: 

an information resource, and ^ 
an identification of a semantic type associated with the information resource for 
controlling an automatic processing of the Web page. 



\ 
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ABSTRACT OF THE DISCLOSURE 



Resources are cached based on the semantic type of the resource. The cache management 
strategy is customized for each semantic type, using different caching policies for different 
semantic types. Semantic types that can be expected to contain dynamic information, such as 
news and weather, employ an active caching policy wherein the resource in the cache memory is 
chosen for replacement based on the duration of time that the resource has been in cache 
memory. Conversely, semantic types that can be expected to contain static resources, such as 
encyclopedic information, employ a more conservative caching strategy, such as LRU (Last 
Recently Used) and LFU (Least Frequently Used) that is substantially independent of the time 
duration that the resource remains in cache memory. Additionally, some semantic types, such as 
communicated e-mail messages, newsgroup messages, and so on, may employ a caching policy 
that is a combination of multiple strategies, wherein the resource progresses from an active cache 
with a dynamic caching policy to a more static caches with increasing less dynamic caching 
policies. The relationship between semantic content type and caching policy to be associated 
with the type can be determined in advance, or may be determined directly by the user, or could 
be based, at least partly, on user-history and profiling of user-interaction with the resources. 
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